Global organization of the Wordnet lexicon.

نویسندگان

  • Mariano Sigman
  • Guillermo A Cecchi
چکیده

The lexicon consists of a set of word meanings and their semantic relationships. A systematic representation of the English lexicon based in psycholinguistic considerations has been put together in the database Wordnet in a long-term collaborative effort. We present here a quantitative study of the graph structure of Wordnet to understand the global organization of the lexicon. Semantic links follow power-law, scale-invariant behaviors typical of self-organizing networks. Polysemy (the ambiguity of an individual word) is one of the links in the semantic network, relating the different meanings of a common word. Polysemous links have a profound impact in the organization of the semantic graph, conforming it as a small world network, with clusters of high traffic (hubs) representing abstract concepts such as line, head, or circle. Our results show that: (i) Wordnet has global properties common to many self-organized systems, and (ii) polysemy organizes the semantic graph in a compact and categorical representation, in a way that may explain the ubiquity of polysemy across languages.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Hindi Subjective Lexicon : A Lexical Resource for Hindi Polarity Classification

With recent developments in web technologies, percentage web content in Hindi is growing up at a lighting speed. This information can prove to be very useful for researchers, governments and organization to learn what’s on public mind, to make sound decisions. In this paper, we present a graph based wordnet expansion method to generate a full (adjective and adverb) subjective lexicon. We used s...

متن کامل

Bringing together over- and under- represented languages: Linking WordNet to the SIL Semantic Domains

We have created an open-source mapping between the SIL’s semantic domains (used for rapid lexicon building and organization for under-resourced languages) and WordNet, the standard resource for lexical semantics in natural language processing. We show that the resources complement each other, and suggest ways in which the mapping can be improved even further. The semantic domains give more gene...

متن کامل

WordNet, EuroWordNet and Global WordNet

1 WordNet In 1978, George Miller started the development of a database with conceptual relations, as an implementation of a model of the mental lexicon. The database, called WordNet, was organized around the notion of a synset between which semantic relations are expressed. A synset is a set of words with the same part-of-speech that can be interchanged in a certain context. For example, {car; ...

متن کامل

Analysis of Classification Techniques for Mining Reviews Using Lexicon and WordNet Using R

with the exponential growth of social media i.e. blogs and social networks, organizations and individual persons are increasingly using the number of reviews of these media for decision making about a product or service. Opinion mining detects whether the emotions of an opinion expressed by a user on Web platforms in natural language, is positive or negative. This paper presents extensive exper...

متن کامل

Wordnet Based Lexicon Grammar for Polish

In the paper we present a progress report on a long-term ongoing project concerning the lexicon-grammar of Polish. It is based on our former research focused mainly on morphological dictionaries, text understanding and related tools. By Lexicon Grammars we mean grammatical formalisms which are based on the idea that a sentence is the fundamental unit of meaning and that grammatical information ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Proceedings of the National Academy of Sciences of the United States of America

دوره 99 3  شماره 

صفحات  -

تاریخ انتشار 2002